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Abstract. We study the physical properties of the Lyman-a forest by applying the inversion method described by 
Pichon et al. (2001) to the high resolution and high S/N ratio spectrum of the Zcm ~ 2.40 quasar HE 1122—1628 
obtained during Science Verification of UVES at the VLT. We compare the column densities obtained with the 
new fitting procedure with those derived using standard Voigt profile methods. The agreement is good and gives 
. confidence in the new description of the Lyman-a forest as a continuous field as derived from our method. We show 

' that the observed number density of lines with log A*' > 13 and 14 is, respectively, 50 and 250 per unit redshift 

at z ~ 2. We study the physical state of the gas, neglecting peculiar velocities, assuming a relation between the 
overdensity and the temperature. T = T{p{x)/ p)"^^ . There is an intrinsic degeneracy between the parameters 
f3 and T. We demonstrate that, at a fixed /?, the temperature at mean density, T, can be uniquely extracted, 
however. While applying the method to HE 1122-1628, we conclude that for 0.2 < /3 < 0.3, 6000 < T < 15000 K 
at 2 ~ 2. We investigate the small-scale structure of strong absorption lines using the information derived from 
O [ the Lyman-/?, Lyman-7 and C iv profiles. Introducing the Lyman-/? line in the fit allows us to reconstruct the 

density field up to p/p ~ 10 instead of 5 for the Lyman-a line only. The neutral hydrogen density is of the order 
^ ' of ~2xl0^^ cm~^ and the C iv/H I ratio varies from about 0.001 to 0.01 within the complexes of total column 

density 7V(H l) ~ 10^^ cm~^. Such numbers are expected for photo-ionized gas of density wh ~ 10"'* cm~^ 
and [C/H] ~ —2.5. There may be small velocity shifts (~10 km s~*) between the peaks in the C iv and H I 
density profiles. Although the statistics is small, it seems that C iv/H I and nni are anti-correlated. This could 
be a consequence of the high sensitivity of the C iv/H I ratio to temperature. The presence of associated O vi 
' absorption, with a similar profile, confirms that the gas is photo-ionized and at a temperature of T ~ 10^ K. 

Key words. Methods: data analysis - Methods: N-body simulations - Methods: statistical - Galaxies: intergalactic 
medium - Galaxies: quasars: absorption lines - Gosmology: dark matter 



1. Introduction 

The numerous absorption lines seen in the spectra of dis- 
tant quasars (the so-called Lyman-a forest) reveal the in- 
tergalactic medium (IGM) up to redshifts larger than 5. It 
is believed that the space distribution of the gas traces the 
potential wells of the dark matter. Indeed, recent numer- 
ical iV-body simulations have been successful at repro- 
ducing the observed characteristics of the Lyman-a for- 
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est (Cen et al. 1994, Petitjean et al. 1995, Hernquist et 
al. 1996, Zhang et al. 1995, Miicket et al. 1996, Miralda- 
Escude et al. 1996, Bond & Wadsley 1998). The IGM is 
therefore seen as a smooth pervasive medium which can be 
used to study the spatial distribution of the mass on scales 
larger than the Jeans' length. This idea is reinforced by 
observations of multiple lines of sight. It is observed that 
the Lyman-a forest is fairly homogeneous on scales smaller 
than 100 kpc (Smette et al. 1995, Impey et al. 1996) and 
highly correlated on scales up to one megaparsec (Dinshaw 
et al. 1995, Fang et al. 1996, Petitjean et al. 1998, Crotts 
& Fang 1998, D'Odorico et al. 1998, Young et al. 2001). 
The number of known suitable multiple lines of sight is 
small, however, and the sample need to be significantly 
enlarged before any firm conclusion can be drawn. 
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The standard method to analyze the observed Lyman- 
a forest is to fit the spectra as a superposition of Voigt 
profiles (e.g. Carsweh et al. 1987). The IGM is then de- 
scribed as a juxtaposition of discrete clouds rather than 
a continuous field. Very recently, new methods have been 
implemented to recover the real space density distribution 
of the IGM by inversion of the Lyman-a forest. Nusser & 
Haehnelt (1999, 2000) proposed an iterative scheme based 
on Lucy's deconvolution method (Lucy 1974). Pichon et 
al. (2001) used a general non-linear explicit Bayesian de- 
convolution method which offers the possibility to add as 
much a priori information as is available. The method has 
been implemented in order to recover the 3D topology of 
the large-scale structures when applied to the inversion of 
a network of adjacent lines of sight. 

Here we apply this method to invert a high S/N ratio 
and high resolution spectrum of HE 1122—1628 obtained 
with UVES at ESO and therefore investigate the physi- 
cal conditions of the IGM at z ~ 2. Data are presented 
in Section H. The assumed model for the Lyman absorp- 
tion is described in Section pi The method is sketched 
in Appendix B. In Section ^ we investigate how the 
method can be used to constrain the temperature-density 
relation in the IGM and apply it to HE 1122-1628. 
Section ^ is dedicated to a comparison between a tradi- 
tionnal Voigt profile decomposition and our method. In 
Section ^ we use the additional information provided by 
higher Lyman series absorption profiles to deconvolve the 
saturated Lyman-a lines. We use this information to in- 
vestigate the C iv/H I ratio through high density profiles . 
We finally draw our conclusions in Section ^ 

2. Data 

The my = 15.5 and Zom = 2.40 quasar HE 1122-1628 was 
observed during Science Verification of the Ultra-violet 
and Visible Echelle Spectrograph mounted on the 8.2 m 
Kuyen telescope of the European Southern Observatory, 
mount Paranal (Chile). The spectrum, made public by 
ESO, was reduced and normalized by Cedric Ledoux us- 
ing MIDAS, the ESO data reduction package and follow- 
ing the pipeline step by step. Metals were identified and 
flagged when redshifted in the Lyman-a forest before any 
fitting procedure was applied. Table A.l gives the results 
(position, Doppler parameter and column densities) of fit- 
ting the Lyman-a forest of HE 1122-1622 with two meth- 
ods: (i) an automatic Voigt profile procedure (Carswell et 
al. 1987) and (ii) the Bayesian inversion method (see be- 
low and Pichon et al. 2001 for details). The redshift range 
probed by the spectrum is 1.86 — 2.38 for the Lyman-a for- 
est and 2. — 2.38 for the Lyman-a and Lyman-/? forests. 
In the course of the study, we use three synthetic spectra, 
Fi , F2 and F3 . The spectra were generated analytically ap- 
plying the procedure described by Bi & Davidsen (1997) 
and using a temperature-density relation (see Eq. |^) with 
/3 = 0.25 and T ^ 10 000, 20 000 and 30 000 K, respec- 
tively (see Choudhury et al. 2001). Photon and pixel noise 
is added in such a way that the signal-to-noise ratio is 



50, corresponding approximately to the quality of the ob- 
served spectrum. 

3. The Bayesian inversion method 

We inverse the observed Lyman-a forest for the density 
field using a Bayesian method. This method is a generali- 
sation of Wiener filtering, which is capable of dealing with 
a non-linear model for the Lyman-a absorption, including 
for instance the effects of gas temperature and of peculiar 
motions. In this study, we assume the model described 
below, in which peculiar motions are neglected. 
The optical depth, r, in the Lyman-a forest is 



t{w) 



CCTq 



■ exp 



where c is the velocity of light, do is the Lyman-a absorp- 
tion cross section, H is the Hubble constant at redshift 
z, rifji the H I density field and b the Doppler parame- 
ter. Lo and X are velocities (in km s^^). The observed flux, 
F, is simply F — exp(— r). We use i?o — 75 km/s/Mpc 
throughout the paper. 

Following Hui & Gnedin (1997), we assume a temperature- 
density relation 



T 



f /Odm(x) 



7-1 



(2) 



where T is the temperature at mean dark matter density, 
Pdm , and 7 — 1 is the polytropic index. And therefore, 



&(x) = 13 kms 



-1 / T / /3dm(x) 
V lO^X ^ pDM 



(3) 



where 7 — 1 = 2/3. 

A simple functional relation between the dark matter den- 
sity pdm and the neutral hydrogen particle density tt-hi is 
assumed. 



nHi(x) = nm 



( /3dm(x) 



V PDM 

where a — 2-1 A x /?. Eq. ^ reads 



(4) 



t{w) — A{z)ci 



X exp -C2 



{w — xY 



dx . 



(5) 



The parameters ci, C2 and A{z) depends on the charac- 
teristic temperature of the IGM 



T 



, C2 = 13 



and A{z) 



(6) 



where J is the ionizing flux, assumed to be uniform. A{z) 
is chosen in order to match the observed average optical 
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depth of the Lyman-a forest (~ 0.2 at z = 2). In the case 
of HE 1122-1628, we measure A{z) = 0.22 in the UVES 
spectrum and therefore use this value in the following. 
We assume that the a priori density obeys a lognormal dis- 
tribution characterised by its a priori correlation matrix. 
We plot in Fig. |l| the PDF of the analytical density field 
used to generate spectrum Fi (solid line; the peculiar ve- 
locities are not considered) together with the PDF recon- 
structed after inversion of Fi (dashed and dotted lines for 
S/N ratios of 50 and 1000 respectively). Overplotted as a 
dashed line, but shifted by a factor of ten for clarity, is the 
PDF for the HE 1122-168 spectrum. This justifies the log- 
normal approximation. The discrepancy beyond p/ p ^ 1 
is due to redshift distortion (see Nusser et al., 2000). 
The sought parameters, M, are the logarithm of the con- 
tinuous field pdm/pdm (named p in the following), to- 
gether with the two parameters of Eq. ^, T and f3. The 
priors of the inversion are described by the above equa- 
tions |[|3[ |[ plus a probability distribution of M, involving 
a prior guess Mq and a correlation function Co for p 

Co.., X exp(^-i^i^^) . (7) 

In this work, Mq = (po = 0, T and /3 fixed). The model 
that relates the data D and the parameters M is given by 
Eq. I 

An example of the reconstruction is given in Fig. |^ which 
shows part of the spectrum of HE 1122—1628 (upper 
panel) and the reconstructed density (lower panel). 

4. Constraining the temperature at mean density 

The method described in Section |^ is applied to the spec- 
trum of HE 1122—1628 to constrain the parameters of 
the temperature-density relation : the exponent /? and the 
temperature at the mean density T (see Eq. ||). The inver- 
sion is performed for a grid of f3 and T values. Following 
Hui & Gnedin (1997), we restrict our study to f3 in the 
range 0.2-0.3. 

4.1. Application to synthetic spectra 

4.1.1. Definition of a borderline in the (/3,T) plane 

We first try to recover the initial couple {f3o,To) used 
to generate one of the synthetic spectra (see Section 
For this, we fix a couple (/3,T) and inverse the spectrum 
for the density field. We derive the minimum reduced 

value obtained by fitting the model to the data, i.e. 
(D - g{M))^.C^\{T> - 5(M)), see Section | The resuh 
of fitting the synthetic spectrum Fi is shown in Fig. ||. The 
(/3,T) plane is divided into two regions with, respectively, 

< 1 and > 1; separated by a well-defined approxi- 
mately straight borderline. The correct value of T lies on 
the = 1 borderline. This is expected as explained in 
Appendix A. 

We have investigated the dependance of the position of 
this borderline with respect to the parameters of the in- 
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Fig. 1. Simulated and reconstructed probability 
Distribution Function (PDF), p P{log{p)), of the 
over-density p/p. The simulated density PDF is plotted 
as a solid line. From this density, a spectrum has been gen- 
erated in real space and thermally broadened. Different 
amounts of noise have been added. The density field is 
then recovered using the inversion technique described 
in Section |[ The dotted and dashed lines correspond to 
the PDF of the density recovered from spectra with a 
S/N ratio of 50 and 1000, respectively. The two PDFs 
are approximately the same except for the low density 
region. Noise induces a lower and an upper cutoff in the 
recovered PDF. The PDF in the redshift-space spectrum 
is shown as a dot-dashed line. The PDF recovered from 
the UVES spectrum scaled up by a factor of 10 for clarity, 
is shown as another dashed line. The density PDF in the 
simulation is plotted with the same scaling, upper solid 
line, to aid the comparison. 



version, namely the correlation amplitude (cp) and corre- 
lation length (^r) of the prior matrix Cq. should be of 
the order of the spectral resolution (here, the pixel cor- 
responds to 0.05 A or 0.047 Mpc comoving) ; cTp con- 
trols the fluctuations of the density between two itera- 
tions. CTp should be neither too small to allow for the re- 
covery of the high density peaks, nor too large for sta- 
bility of the algorithm. It can be seen in Fig. ^ that for 
0.2 < cTp < 0.5 and 0.12 < ^^(Mpc c_om.) < 0.55, the 
= 1 border corresponds to the same T within 10%. We 
use (Tp = 0.25 and = 0.2. 

Note, however, that a high x^ value can be the conse- 
quence of only a small portion of the spectrum beeing 
poorly fitted. This implies that, when fitting real data, 
the borderline can be considered as a lower limit for the 
temperature at fixed /?. 

It remains that the inversion problem is degenerate: it is 
always possible to find a good fit to the data with any 
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Fig. 2. Portion of the spectrum of HE 1122—1628 (upper panel) and corresponding reconstructed DM density (bottom 
panel) when peculiar velocities are not considered. The vertical marks indicate the positions of individual components 
found by the automatic Voigt profile fitting procedure. The S/N ratio in the data is of the order of 60 per pixel. 



given value of /3 provided T is changed. We have tried to 
find a systematic approach to be able to recover from the 
synthetic data the temperature at mean density T that 
was used to create the data. 



4.1.2. Where is the strongest constraint on T ? 

Zaldarriaga et al. (2000) have shown that the flux power 
spectrum is affected by changes in the temperature only 
for overdensities between 0.9 and 1.8. ft is likely that not 
all the absorption features are well suited to constrain the 
temperature and one can wonder which features constrain 
the temperature best. Indeed, low density regions are asso- 
ciated with diffuse gas in voids where temperature broad- 
ening is washed out by the smoothness of the density field 
itself. This suggests that the strongest constraint on the 
temperature determination comes from moderately satu- 
rated lines. 

To check this, we divided the spectrum in sub-spectra, 
of typical width 20 A. In each sub-spectrum we define the 
maximum optical depth, Tmaxj corresponding to a normal- 
ized flux Fmin- As argued previously, it is expected that 
the sub-spectra where Fmin is large will poorly constrain 
the temperature. We therefore limit the inversion to sub- 



spectra where some condition on Faiin is imposed. We have 
checked that the width of the sub-spectra is not a criti- 
cal parameter for the procedure. It must be large enough 
to include an important range of apparent optical depths 
but not too large, in order to keep enough weight on large 
optical depths. 



Fig. H shows the results of the inversion of the sub-spectra 
of the synthetic spectrum Fl, varying T while keeping 
(3 equal to 0.25 (which is the correct initial value). The 
reduced is plotted versus the mean temperature for 
^niin < 0.1 (only strong features are considered; results are 
shown as open squares in the figure) and for 0.2 < Fy^in < 
0.3 (lines of intermediate saturation only are considered; 
results are shown as filled triangles in the figure). It is 
apparent that, for weak lines, remains smaller than one 
for a very wide range of temperatures. Instead, for strong 
lines, is larger than one for all temperatures larger than 
the correct value T — lO"' K. Note that a normalized flux 
of 0.2 corresponds to an overdensity of p/p ^ 2. Because 
of this behaviour, only sub-spectra with Fmin < 0.2 are 
used in the following to constrain T. 




Fig. 3. Result of the fit of a synthetic spectrum con- 
structed from an analytical simulation and assuming 
an temperature-density relation defined by Eq. ^ with 
Po = 0.25 and To = 10 000 K (spectrum Fi, see Section |). 
The inversion is performed for the density field, for a 
grid of couples (/3,r), and the final minimum reduced 
is plotted versus (3 and T. The plane corresponding 
to = 1 is shown with dotted lines. The locus of the 
points (/3,T,x^=l) is what is refered to as the borderline 
in the text. The correct couple (/3o, Tq), marked by a filled 
square, is located on this borderline. 



4.1.3. Estimation of the temperature T 

The reduced is computed for each sub-spectrum, la- 
belled i, and each couple {/3,T). This defines the tem- 
perature on the borderline, Tjj^, for each sub-spectrum. 
The highest T^.j correspond to segments where the den- 
sity field is so smoothed that the effect of temperature is 
very small. This introduces a systematic bias against low 
temperatures. 

Using the synthetic spectrum, we check that, for the 
correct value of P, an appropriate estimator of To is the 
median of the 1*^' quartile of the cumulative distribution, 
P(T), defined as the region where 0. < P(T) < 0.25. This 
is illustrated in Fig. || where this estimator of To is plotted 
versus /3 for the three synthetic spectra Fl (10 ODD K), F2 
(20 000 K) and F3 (30 000 K). The error bars represent 
the range of temperatures in the I*'' quartile. The three 
initial temperatures are recovered with this estimator for 
the correct value of /3 = 0.25. 

4.2. Application to HE 1122-1628 and discussion 

The same procedure is applied to the UVES spectrum of 
HE 1122—1628. It can be seen in Fig. ^ (lower panel) that, 



log(T) 

Fig. 4. Result of the fit of a synthetic spectrum con- 
structed from a simple ID density field (Fig A. 2) and as- 
suming a temperature-density relation defined by Eq. 
with Po = 0.25 and To = 10 000 K. The inversion is per- 
formed for the density field, for different values of T while 
/3 = /3o is fixed. The final minimum reduced is plotted 
versus T. The horizontal and vertical dotted lines corre- 
spond to = 1 and T = Tq respectively. The intersection 
of the curves with the x^=l lin^ defines the borderline. 
The different curves correspond to different values of the 
priors as defined by the matrix Co, i.e. its variance ap 
and correlation length, (see Section ||). = 0.12 — 0.55 
Mpc comoving (top to bottom panel) and ap — 0.2 — 0.5 
(solid and dashed lines). In this range, the value of T on 
the borderline is constant within 10%. 



as already noted with analytical data, the fit of strong lines 
is more constraining than the fit of weak lines. 
Results on the determination of T along the line of sight 
to HE 1122-1628 is given in Fig. 0. This figure (when 
compared to Fig. ^ suggests that the temperature of the 
IGM at mean density at redshift z ~ 2 is only slightly 
larger than lO'' K. Accounting for the uncertainty due to 
the inherent lack of knowledge on /3, we conclude that 
T - lOOOO^g^Sgo K at 2 = 2. 

Other methods have been applied to determine the 
temperature-density relation of the IGM over the past few 
years. Schaye et al. (2000), Ricotti et al. (2000) and Mc 
Donald et al. (2000) deconvolve the spectra using Voigt 
profile fitting. They then consider that the temperature 
can be estimated from the lower bound of the locus of 
observed values of the Doppler parameter and neutral hy- 
drogen column density. This is reminiscent of the reason- 
ing used by Pettini et al. (1990). The three groups de- 
rive similar results. For an average redshift of 2.4, they 
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Fig. 5. Minimum reduced versus log T obtained af- 
ter fitting 20 A wide portions of the synthetic spectrum 
Fi (upper panel) or of the HE 1128—1628 spectrum (lower 
panel). The portions are chosen so that the minimum value 
of the normalized flux is smaller than 0.1 (open squares) or 
is in the range 0.2—0.3 (filled triangles). The reconstruc- 
tion is performed for f3 = 0.25. Note that in the simulation 
(upper panel), T = 10000 K. 
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Fig. 6. Recovered parameters of the temperature-density 
relation for different analytical spectra. Each of the 
spectra are generated using the same density field and 
P — 0.25 but with three different temperatures: T — 
10 000 K (circles), 20 000 K (squares), 30 000 K (crosses). 
Results from inversions are shown as open symbols; the 
initial parameters are shown as filled symbols. See text for 
the definition of the estimator. The largest uncertainty in 
the determination of the absolute temperature from the 
inversion comes from the degeneracy between /3 and T. 



find, respectively, T/W^ = 0.8 - 2,_(3 = 0.25 ± 0.07; 
T/lO'i = 0.7 - 5 and /3 = - 0.3 and = 1.74 ± 0.19 

and P = 0.26 ± 0.07 . As emphasized by Zaldarriaga et 
al. (2000), these authors use a lower limit of the width 
of the lines over the whole spectrum. They therefore de- 
rive a lower limit on the temperature. Another approach 
that does not make use of any fitting procedure has been 
developed by Zaldarriaga et al. (2000). It relies on the 
determination of the flux power-spectrum. These authors 
show that there is no power on scales smaller than the 
scale corresponding to thermal broadening 6t. They find, 
for the same redshift, T(/3 = 0)/10'' = 1.8 - 3.2 and 
T(/3 = 0.3)/10'' = 1. - 1.6. AU analyses show a depen- 
dance of T over /3 similar to that shown in Fig. 0. 
In addition, Hui & Gnedin (1997) conclude from the re- 
sults of hydro-simulations that T and f3 reach an asymp- 

/ 2 

totic value at low redshift: T oc — and 

/3>0.15 — 0.24, depending on the exact reionization 
history. This corresponds approximately to a temperature 
at overdensity unity that decreases from about 10 000 K 
at z = 4 to 8000 K at z = 2. Note however that they did 
not include the effects of He ii reionisation. 



Our findings are therefore consistent with previous pub- 
lished analysis. Note that the remaining discrepancies in 
the results obtained by the various authors could be a 
consequence of the fact that a single temperature-density 
relation is assumed for the gas. In particular, it is probable 
that spatial variations of the ionizing flux can have a sub- 
stantial influence on the temperature (see e.g. Zaldarriaga 
2001). It should be possible to investigate this in more 
detail using our procedure if a large number of lines of 
sight is observed at the same resolution and S/N ratio. 
Such data will be available after a few years of observa- 
tions with UVES at the VLT. Moreover, the determina- 
tion of the characteristic temperature of the IGM over a 
large redshift range could constrain the reionisation his- 
tory. Indeed, Schaye et al. (2000) have tentatively found 
a peak of T = 10"' '* K at a redshift of 3.2 (compared to 
about lO'' K at redshifts 3.5 and 2) and interpret this as 
evidence for He ii reionisation. 

5. Comparison with Voigt profile fitting 

In this Section, we compare the column densities evalu- 
ated along the line of sight to HE 1122—1628 from the fit of 
the Lyman-Qf forest using the inversion method on the one 
hand and VPFIT, the automatic Voigt profile procedure 
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Fig. 7. Parameters of the temperature-density relation re- 
covered by inversion of the UVES spectrum of HE 1122- 
1628. The temperature at mean density at z ~ 2 is found 
in the range 6 000—15000 K. The trend shown in this fig- 
ure is close to that found in the Fl spectrum (see Fig. ^ 
where T=10 000 K. 



(Carswell et al. 1987) on the other hand. The latter proce- 
dure decomposes the spectrum in discrete absorption com- 
ponents, whose Doppler parameters and column densities 
are fitted to the data. Our method derives the density in 
each velocity pixel and we must integrate this density over 
some velocity range, chosen somewhat arbitrarily, to re- 
cover a column density and compare it to VPFIT's. At 
the position of each cloud defined by VPFIT, we integrate 
over twice the Doppler parameter of the cloud. Table A.l 
summarizes the results. 

Fig. H shows the histograms of the values taken by the 
ratio between the two column densities when only the 
Lyman-a line is considered (solid line) and when the 
Lyman-/3 line is used in addition to constrain the fit 
(dashed line). The mean difference between the two es- 
timations is about 15%, well within the uncertainties due 
to the measurements themselves. The inversion method 
usually underestimates the high column densities. This is 
a consequence of the lack of information in the saturated 
part of the corresponding absorption lines. In addition, 
the fact that the procedure reconstructs a continuous field 
minimizes the importance of the wings of the line. The sit- 
uation is improved, as expected, when the Lyman— /3 line 
is used to constrain the fit. The mean difference is then 
reduced to about 10%. 

The improvement in the inversion of the density field when 
including the Lyman-/3 forest is emphasized in Fig. |. We 
inverse one of the synthetic spectra for which we know 
the real overdensity in each pixel. Fig. gives the recon- 



structed overdensity versus the overdensity used to simu- 
late the spectrum. When the Lyman-a line only is used, 
the density field is recovered up to overdensities slightly 
larger than 5, whereas when the Lyman-/? forest is used 
in addition, overdensities up to 10 are recovered. 
Fig. ^ shows the H i column density distribution derived 
from the inversion of the Lyman-a and Lyman-/? forests 
observed toward HE 1122—1628 together with the results 
from the automatic fitting of the Lyman-a forest using 
VPFIT and the data points from Kim et al. (1997). A cor- 
rection has been applied at the low column density end to 
take into account incompleteness induced by the blending 
of weak lines with strong ones (Kim et al. 1997). There 
is a remarkable agreement between the inversion method, 
VPFIT and previous observations. 

The number of absorption lines per unit redshift is given in 
Fig. |ll| as a function of redshift. Although previous anal- 
yses have concentrated on strong lines, log A^(H i) > 14, 
most of the constraints on the IGM come from weaker 
lines. Our results at z ~ 2 for log A^(H i) > 13 and 
log 7V(H i) > 14, together with other observational points 
(Giallongo et al. 1996, Kim et al. 1997, Weymann et al. 
1998, Savaglio et al. 1999, Kim et al. 2000), nicely fall on 
top of the predictions by Riediger et al. (1998). 

In this Section, we have not yet taken advantage of the 
true capabilities of the Bayesian method. Indeed, we first 
inverted the spectrum to a density field and then repro- 
jected the result into discrete clouds. In the following, we 
concentrate on other questions, taking full advantage of 
the new method. 

6. Structures in the strong lines 

As long as only unsaturated lines are considered, the corre- 
sponding structures are clearly identified in the spectrum. 
The same structures are lost, however, when blended with 
saturated lines whose width is often of the order of 
100 km s~^ (see below). The nature of the strong lines 
(log /V(H i) > 14) is unclear and they could be associ- 
ated with filaments, halos with rotation velocities less than 
about 100 km s~^ and/or external regions of galactic ha- 
los (see e.g. Miicket et al. 1996). Observations of adjacent 
lines of sight have concluded that these lines arise in com- 
plexes with transverse dimensions larger than 200 kpc. To 
gain insight into the small-scale structure of these com- 
plexes, we can try to invert the corresponding Lyman-a 
lines, adding information from all observed lines in the 
Lyman series and when available from the associated C iv 
lines. The inversion is performed in redshift space. 



6.1. Inversion of strong lines 

We have selected for further analysis the 12 lines 
seen in the spectrum of HE 1122—1628 which have 
log N(R i) > 14 and a large enough redshift {zahs > 2.03) 
so that the Lyman-/? line is redshifted in the observed 
wavelength range. A more accurate Voigt profile fit has 
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Fig. 8. Histogram of the difference between the logarithm 
of the H I column densities obtained using the Voigt pro- 
file fitting procedure (Nfn) on the one hand, and the inver- 
sion method (A'loc) on the other hand, from the fit of the 
Lyman-a forest only (solid line) or for both the Lyman- 
a and Lyman-/? forests (dashed line), evaluated along the 
line of sight to HE 1122-1628. For the inversion procedure, 
the column densities are estimated by integrating the re- 
covered density over twice the Doppler parameter found 
by the Voigt profile fit. 

been performed on these individual lines, including when- 
ever possible other lines in the Lyman series (mostly 
Lyman-/3 and Lyman-7). Also, we have inverted the flux 
with the Bayesian method including all pieces of informa- 
tion available. Since the associated C iv lines are most of 
the time too weak, we do not include the C iv profile as 
a constraint for the inversion and keep it instead for an 
a-posteriori discussion of the C iv/H i ratio. 

In principle, the Lyman-/? line can always be blended with 
an intervening Lyman-a line at lower redshift. We there- 
fore define 



TLyp = min(T2/3, 



(8) 



where r^/j and Tza are the observed optical depth at the 
position of the Lyman-/? and Lyman-a absorptions respec- 
tively; fa, fp, Aq and A/3 are the oscillator strengths and 
rest-wavelengths of the Lyman-a and Lyman-/? lines. We 
then invert r^yp and r^ya simultaneously (with the same 
relative weight for the two lines). 

A few examples of the fits and reconstructions are given 
in Figs |l^,|l^,^ 18. From bottom to top, the different 
panels correspond to (i) the reconstructed density, (ii) 




Psim 

Fig. 9. Over-densities recovered after inversion of a syn- 
thetic spectrum versus the over-densities used to generate 
the synthetic spectrum, assuming T = IC^K,/? = 0.25. 
When only Lyman-a data is available for the inversion pro- 
cedure (filled squares), overdensities larger than 5 cannot be 
recovered because of saturation effects. This situation is very 
much improved when using the Lyman-/? forest data (open cir- 
cles) . 



the Lyman-a absorption profile, (iii) the Lyman-/? or the 
Lyman-7 profiles, (iv) the associated C ivAA1548,1550 
doublet and (v) when possible, the associated O vi ab- 
sorption. The fit models are overplotted on the observed 
Lyman profiles as dashed lines. The positions of the com- 
ponents used in the Voigt profile fits are marked by vertical 
lines. 



6.2. Validation 

The inversion procedure uses a correlation length to prop- 
agate the information along the line of sight. It is impor- 
tant to check the robustness of the results when changing 
this parameter. For this, we invert the Lyman-a structure 
at 4075 A different correlation lengths (see Fig. middle 
panel) . We use the Lyman-/? and Lyman-a profiles to con- 
strain the fit. It can be seen that for correlation lengths 
larger than the resolution of the spectrum (~0.1 A at this 
wavelength), the results are weakly dependent on the ex- 
act value of the parameter. It is only for very small corre- 
lation lengths (the pixel size is 0.05 A), that the structures 
become artificially peaked. In the following, we always use 
a correlation length of 0.2 A. 

As the temperature-density relation is determined for the 
whole spectrum, it is possible that the global solution 
is not representative of the physical state of the gas in 
one particular place. To estimate the influence of a possi- 
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Fig. 10. Column density distribution at z ~ 2.3, cor- 
rected for incompleteness, as measured toward HE 1122- 
1628 from the inversion procedure (crosses) and the Voigt 
profile procedure (squares). Also plotted is the distribu- 
tion obtained by Kim et al. (1997, triangles). 

ble variation of the temperature-density relation from one 
place to the other, we have inverted the same Lyman-a 
feature at 4075 A using three different values of the cou- 
ple (/3,T), found on the borderline defined in Section ^. 
Note that the values used for the three inversions are ex- 
treme and correspond to variations of a factor of three in 



(3 and T. It can be seen in Fig. 14 (bottom panel) that al- 
though the absolute amplitude of the density field varies 
by a factor of ~2 from one inversion to the other, the 
shapes of the density profiles are very similar and all the 
structures are recovered. 



0.4 

Log(1+z) 

Fig. 11. Number density of lines versus redshift for dif- 
ferent column density thresholds: log 7V(H i) > 13 (Kim 
et al. 1997, thick squares; Kim et al. 2000, thick circles; 
this work, thick losange), log A^(H i) > 14 (Savaglio et al. 
1999, squares); Weymann et al. 1998, crosses; Kim et al. 
1997, circles; Giallongo et al. 1996, triangles; this work, 
losange) . 



& Fang 1998). However, it must be noted that the radius 
derived from analysis of the metal lines associated with 
the Lyman-a forest are much smaller (Hu et al. 1995; see 
next Section). 



6.3. Correlation function 

The two-point correlation function is defined as, 

C(Aw) =< SF{v)SF{v + Av)>. (9) 

The function computed for the density field reconstructed 
from the inversion of the HE 1122—1628 spectrum is plot- 
ted in Fig. |l^ together with the correlation function of 
the flux. The curves are normalized to ^(0) = 1. To com- 
pute errors, we used the bootstrap method presented in, 
e.g., McDonald et al. (1999) and found that the correla- 
tion function is not corrupted by the amount of noise. On 
small scales, the shape of the function is probably related 
to the internal structure of the Lyman-a clouds (or more 
realistically Lyman-a complexes) , the dimension of which 
is expected from A^-body simulations to be of the order of 
1 Mpc (~ 130 km s~^). Observations of adjacent lines of 
sight also indicate that the absorptions are correlated on 
scales of the order of 0.2 to 1 Mpc (e.g. Petitjean et al. 
1998, D'Odorico et al. 1998, Dinshaw et al. 1998, Crotts 



6.4. The Cw/H I ratio 

The determination of the amount of metals present in the 
Lyman-a forest attracts much interest because this is a 
key issue for understanding how the first objects in the 
Universe were formed. Since the first observations of met- 
als (mostly C iv) in at least half of the Lyman-a lines 
with log N(E i) > 14.3 (Cowie et al. 1995, Songaila & 
Cowie 1996), efforts have been concentrated towards con- 
straining the metal content of the gas with smaller col- 
umn densities. As direct detection of the corresponding 
very weak C iv lines is difficult even if the physical condi- 
tions are the same at different column densities, a statis- 
tical approach has been used. Ellison et al. (1999, 2000) 
conclude that observations are consistent with a constant 
C iv/H I ~ 2 X 10"'^ ratio down to very small column densi- 
ties. Deriving a carbon metallicity from this is quite uncer- 
tain, however. Using hydro-simulations including photo- 
ionization, Rauch et al. (1997) claim that at 2 = 3, the 
column density ratios of the different ionic species can be 
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Fig. 12. Example of the inversion of a saturated line, 
using both the Lyman-a and Lyman-/? profiles. The 
temperature-density relation used during inversion is de- 
fined by /3 = 0.25 and T = 8000 K (Eq. |). The re- 
covered density is shown in the bottom panel. The other 
panels shows from bottom to top the Lyman-a, Lyman- 
P, C IVA1548 and C ivA1550 profiles. The reconstructed 
flux after inversion is overplotted as dashed lines. Note 
that the blending of the Lyman-/? forest is ignored by the 
inversion. Vertical lines show the position of components 
identified by the Voigt profile fitting procedure. 



well reproduced if a mean metallicity of [Z/H] ~ —2.5 is 
assumed. 

Out of the 12 clouds with A^(H i) > 10^"' cm~^ seen toward 
HE 1122-1628, 7 (60%) have associated C IV absorption. 
The average C iv/H i ratio computed after Voigt profile 
fitting of the absorptions is 4 x 10~^. The optical depth 
ratio tciv/thi, where tciv and thi are defined as in Eq. || 
for all available components is approcimately constant and 
corresponds to C iv/H i~ 2.5 10"'^ for thi > 1 and slightly 
smaller for thi < 1. 

It has already been noted by Songaila & Cowie (1996) 
that the C iv/H i ratio is not constant within a strong 
complex and seems to be larger in the wings of the strong 
lines. Using the inversion method we are in a position to 
study the variations of the C iv/H i ratio within the strong 
complexes. For this we must invert, not only the Lyman 
series as described above but also the C iv absorption. 
Unfortunately, due to the faintness of the C iv absorp- 
tions, the required S/N ratio is very large. Although the 
S/N ratio of the HE 1122—1628 spectrum is very good, it 




3894 3896 3898 3900 

wavelength (Lya) 



3902 



Fig. 13. Same as Fig. ^ for another system. Here T = 
11000 K. Note that the density is reconstructed even 
where the Lyman-a forest is saturated, thanks to the con- 
straints from the Lyman- /3 forest. 



is high enough only for the Zabs = 2.35 and ^abs = 2.37 sys- 
tems (see Figs. [l7| , p^ . We invert the absorptions for the 
H I density using Lyman-a, /3 for the 4076 A complex and 
Lyman-a, /3,7 for the 4096 A complex. The C iv density is 
obtained from the C IVAA1548,1550 doublet. The H i par- 
ticle density is of the order of 10~^ cm^^ which is what is 
expected from simulations (typically tih ~ 10~^ cm~^ and 
H i/H - 10-^ see e.g. Bi & Davidsen 1997, Hellsten et al. 
1997). It must be noted however that this number is ob- 
tained assuming that the density is constant over the pixel 
size, ~ 12 kpc at z = 2. The C iv/H i ratio varies from 
about 0.001 to 0.01 within the complexes. Finally, it can 
be seen that there are slight velocity shifts ('--^lO km s~^) 
between the peaks in the C iv and H i density profiles. 
The most interesting observation, however, is that there is 
some hint of an anti-correlation between C iv/H i and nni 
(see Figs. p^ , pj| upper panel where the C iv/H i ratio is 
plotted versus the H I density). It is difficult to assess the 
anti-correlation as the S /N ratio in the C iv profile is prob- 
ably not high enough and more important, the statistics 
are poor. However, it is apparent on the profiles that (i) 
the strongest H i components have C iv/H I ~ a few 10~^ 
(e.g. region A) and (ii) there is gas in the wing of the pro- 
files of lower H I optical depth and much larger C iv/H i ra- 
tio (e.g. region B). A straightforward interpretation would 



Emmanuel Rollinde, Patrick Petitjean, Christophe Pichon: Inversion of the HE 1122—1628 UVES spectrum 11 



1.0 



0.5 



4070 4075 4080 

b! I M M I ' I ' I ' I ' I ' I 'J 




O.O r I I I I 



J I I I L 




^^-■'--1 ./I.I.I 





4070 4072 4074 4076 4078 

wavelength (Lya) 

Fig. 14. Illustration of the influence of the a-priori corre- 
lation length (see text) and the temperature-density re- 
lation on the recovered density in the Lyman-a complex 
at A ~4076 A (see also Fig. |l^). Top panel: Lyman-a, 
Lyman-/3 and C iv absorption profiles are shown as, re- 
spectively, solid, dashed and dotted lines. Middle panel: 
recovered density for different correlation lengths, 0.05 
A (dashed line) and 0.1-0.2 A (solid line, both profiles 
are undistinguishable). Note that the pixel size is 0.05 
A. Bottom panel: recovered density for three different 
values of the temperature-density relation parameters; 
the three {P,T) couples are on the borderline defined in 
Fig. 0: solid line, /3 = 0.2, T = 15 000 K, dashed line, 
P = 0.25, T = 10 000 K, dotted line, /3 = 0.3, T = 5000 
K. 



be that the carbon metallicity is not homogeneous in the 
cloud. A more simple explanation however is that this is a 
consequence of temperature fluctuations. Indeed C iv/H i 
is quite sensitive to the temperature for T ~ 10^ K (see 
e.g. Rauch et al. 1997). Because of high temperature di- 
electronic recombinations, C iv/H i increases with tem- 
perature. If the anti-correlation is real, this implies that 
the temperature of the gas increases for decreasing H i 
particle density and is close to 10^ K. 
Interestingly enough, it can be noted that for coUisional 
ionization, C iv/C has a maximum ~0.35 at T ~ 10*^ K 
(Arnaud & Rothenflug 1986) and C iv/H i ~ 2x10"^ for 
[C/H] — 10^^'^. However, if coUisional ionization domi- 
nates, no O VI is expected in the g the dominant 
ionization stage of oxygen is O iv at this temperature. 



100.0 



1000. 
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Fig. 15. Two-point correlation functions of the flux in 
HE 1122-1628 at z - 2. (light sofid lines); the three 
lines correspond to the function itself and the function 
±1(7. The dashed lines correspond to the measurements by 
McDonald et al. (2000) at z = 3.89, 3.0 and 2.41 are shown 
as, respectively dashed, dotted and dash-dotted lines. The 
thick solid line is the function calculated for the recovered 
density. 



On the contrary, in the gas we observe, it can be seen in 
Figs. |l^,|l^ that O VI is indeed detected and has a similar 
apparent optical depth as C iv. Note that we plot in each 
case the line of the O vi doublet which is not blended. 
Moreover, as far as the S/N ratio allows the comparison, 
the C IV and O VI profiles are strongly correlated. This is 
a clear signature of the predominance of photo-ionization 
in the ionization balance of the gas. 



7. Conclusion 

We have applied the Bayesian inversion procedure de- 
velopped by Pichon et al. (2001) to the high resolution 
and high S/N ratio spectrum of HE 1122—1628 obtained 
during Science Verification of UVES at the VLT and 
made publicly available by ESO. Assuming a temperature- 



density relation T = T (^^f^) 



2/3 



= T f ^SmM ) for the gas, we invert 

the spectrum for the density field p{x) and recover a grid 
of versus the two parameters /3 and T. 
We certify the method by showing that the inversion pro- 
cedure applied to synthetic data enables us to infer the 
correct temperature when /3 is given the value used to 
generate the data. There is an intrinsic degeneracy how- 
ever between /3 and T which is the source of most of the 
uncertainty in the determination of the temperature of the 
intergalactic medium. We find T - 10000^i^[J[^° K at z - 2. 
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Fig. 16. Same as Fig. |l| for the complex at A4076 A. 
The C IV is strong enough to allow us to study the ratio 
C iv/H I (Fig. Note the shift between the H i and C iv 
strongest peaks (the position of the density peaks does not 
depend strongly on the parameters of the inversion, see 
Fig. Note also the presence of C iv in the wings of H i 
profile (at A 4076.5 A, marker B in Fig. The O vi 
profile is also shown in the upper panel. The similarity of 
the O VI and C iv profiles is a signature of predominance 
of photo-ionization in the ionization balance. 



The results of fitting the Lyman-a forest with the new 
method or with the automatic Voigt profile fitting pro- 
cedure VPFIT (Carswell et al. 1987), when expressed in 
terms of column densities, are found to be very similar. 
The new procedure however has the advantage of recon- 
structing the continuous density field. The latter can be 
recovered up to p/p ~ 5 when the Lyman-a forest only 
is used. Above this value, saturation prevents any reliable 
determination of the density. Overdensities up to 10 can be 
recovered however when the Lyman-/3 absorption is used. 
We investigate the structure of the complexes with 
log iV(H i) > 14 by inverting the Lyman-a and Lyman-/? 
lines and therefore derive the two-point correlation func- 
tion of the dark matter on scales smaller than 1 Mpc. Out 
of the twelve lines with log A^(H i) > 14, seven have de- 
tected associated C iv absorption. The ratio C iv/H i, 
derived from the optical depth ratio tciv/'^'hi, is roughly 
constant, ~ 2.5 10"'^ for thi > 1 and slightly smaller for 
thi < 1. In two of the complexes, we are able to inverse 
the C IV absorption independently. This reveals the pres- 
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Fig. 17. Determination of the C iv/H i ratio along the 
absorption profile of the strong complex at A4076 A (see 
also Fig. p^ . Bottom panel: Lyman-a (solid line), Lyman- 
/3 (dashed line) and C iv (scaled by a factor 4, dotted line) 
absorption profiles. Second panel: recovered H i (solid line) 
and C IV (scaled by the average C iv/H i ratio, dashed 
line) density (see text for details about the reconstruc- 
tion). Third panel: C iv/H i ratio; the horizontal line in- 
dicates the ratio of the C iv and H i integrated densities. 
Top panel: the C iv/H i ratio is plotted versus the H i 
density. The markers A and B correspond to positions 
discussed in the text. 

ence of fluctuations as large as one order of magnitude in 
the C iv/H I ratio. This, together with the presence of 
O VI with the same absorption profile as C iv indicates 
that the gas is photo-ionized and at a temperature close 
to 10^ K. 

Throughout this work, we did not take into account red- 
shift distorsion. It is expected to modify quantitatively the 
result of Section ^ since the lines get thinner or broader 
depending of the exact distribution of the peculiar veloci- 
ties. This must be investigated in detail and is beyond the 
scope of this paper. However, the conclusion of Section |^ 
about the relation between C iv and H i should be valid 
even in real space since velocities modify both profiles in 
the same way. 
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For a polytropic temperature-density relation, T — T p{x)~^^ , 
the optical depth, r(u;), is related to the underlying density via 
Eq. where we set A{z) to 1 for simplicity. Let us explore the 
regime where Eq. ^ is non singular (/3 7^ 0). We argue here that, 
in this regime, at a fixed /3, the inversion problem sketched in 
Section ^ is well paused: i.e. there is a unique solution for the 
underlying density field, given the measured r. 



A.l. Asymptotic Expansion; Gaussian profile 

Let us take the Taylor series of the integrant of Eq. |^ with 
respect to /3 
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Let us also assume for now that the underlying density field 
is Gaussian with an unknown rms, 



p{x) 



1 



27ro- 



■ exp 



(A.2) 



The optical depth associated with this density profile, Eq. |A 
reads : 

.H = i^-)'^<^^'-^M-<^-'l) ( ^W/3^_^+i) , (A.3) 

1/20-^2 + aT(2Gs^ + aT) ^2 (2^42 + aT) ^ 



P{w) 



= T^(log(27r)a + l)a^ - 

2t'^[w^ + [w^a - 3as^) log(27r))a^ - 
To-f ^ [w^a{2 log(7r)a + log(4)a - 4) - 
3cr/(alog(27r)-l))a + 

2(2(74^ +TaY[ - 2aw^ +2crs'^ +Ta) log(cr4)Q - 
8(75^ (a^TO* + aas^{a log(27r) — 5)ui^ + 

log(2^))) . (A.4) 



When /3 = 0, the first term in the bracket of Eq. A.3 vanishes 



the effect of the temperature and the natural broadening of 
the line are degenerate, i.e. all combinations of as and T which 
leave 2as^ + a T inv ariant give the same optical depth profile. 
Thanks to Eq. A.4, when /9 7^ the shape of the line is non 



Gaussian, with a departure from a Gaussian distribution which 
is a function of the underlying temperature of the medium. 
This distortion is characteristic of the temperature-density re- 
lation, and can in principle be disentangled. 

To demonstrate this formally, we may for instance compute 
the variance, (T2,and the kurtosis o"4, of t{w) as a function of 
T and ag (for fixed a and /3) 
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Appendix A: Formal thermal broadening 
deconvolution 
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Solv ing for T in Eq. A.5 and eliminating T subsequently in 
Eq. [A.6| we are left with an implicit equation for as [a2, (T4, /3, a]: 

(6/?(4alog(cri)(log(cri)a + log(27r)a + 3)/3 + 5(3 + 

Q(/31og(27r)(log(27r)a + 6) - 4))a/ + 

2a( - 6/3(4a log((T4)(log(cr6)Q + log(27r)a -f- 2)/3 + 3(3 + 

Q(/3((log^(2) + log(7r) log(47r))a -I- 41og(7r) 

+ log{16)) -2)) aias"" + 

3a (4q log{as)(3 + 2(3 + a(2 log{n)(3 + log(4)/3 - l))al + 
a{2(3log{as)a + (3log{2n)a -a + (3fa4)) = . (A.7) 

As expected, Eq. A.7 becomes singular when /3 = i.e. it be- 
comes: a4 = Saf (this just states that r is also Gaussian and as 
such its reduced kurtosis vanishes) and does not involve as any 
longer. On the other hand, when (3 0, Eq. 



for as and therefore for the corresponding T via Eq. 



A. 7] has a solution 
We 



have therefore demonstrated that it is possible to disentangle 
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natural broadening and thermal broadening for non-singular 
temperature-density relations; in the limit of small /3 but non- 
zero p. 

The abov e argument obviously relies on the Taylor expan- 
sion, Eq. A.l; we have checked that carrying the expansion to 
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/post(M|D) = £(DlM)/prior(M) , 



(B.l) 



1.0^ K I- I- 1. 



0.5 



2nd order did not qualitatively change our conclusions. 



A.2. Finite signal to noise 

In the presence of noise, the actual numerical inversion of a 
given underlying field, p{x), remains a somewhat degenerate 
problem for small values of /3 since the function will depend 
weakly on {as, T) near the minimum (the minimum is "flat"). 
For instance, a relative error of 5% on 02 and 0-4 leads to a rel- 
ative error of 20% for T. We can use Eq. A. 7 to estimate what 
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range of p we need to consider to better constraint the temper- 
ature. A remaining issue is how does the inversion deal with 
lines which have intrinsic kurtosis ? It turns out that statisti- 
cally (ie averaged over a few such lines) we have some insight 
of what the shape of the underlying density profile should be, 
since the relative distortions induced by {3 are the same for 
all lines. To demonstrate this, let us investigate how our non- 
parametric procedure (Section^ recovers the temperature, T, 
for a large range of temperature (5 000 K to 30 000 K) when 
dealing with simulated profiles (provided the simulated data 
span a sufficiently large range of densities). 
For this purpose, we synthetize a line of sight of H I clouds 
whose width and height span the range 40 < fo (km s~^) < 200 
and 3 < pmax < 10. This line of sight is plotted in the lower 



Fig. A.l. line of sight built with a wide range of den- 
sity profiles (lower panel) and the corresponding spec- 
trum computed with a temperature at mean density, T, 
of 15 000 K and /3=0.25 (upper panel). Ten spectra are 
computed with a range of T from 5 000 K to 30 000 K. T 
is then recovered with the method described in Section 



(see Fig. A.2 



can write: 



/p,,t(MlD) = ^exp[-i(D-ff(M))^.C-i.(D-ff(M)) 



panel of Fig. A.l. We generate the corresponding spectrum for 
ten values of T between 5 000 K and 30 000 K while ^ is kept 
constant and equal to 0.25. The spectrum corre spon ding to 
T =15 000 K is plotted in the upper panel of Fig. ^t} The re- 
covered temperature T co rresp onding to a = 1 fit is plotted 
versus the real T in Fig. A.2. For a non-parametric model, a 
X'^ = 1 ± \/'2/N fit does correspond to the best estimate for the 
model. The temperature at mean density is better recovered for 
low T. But the bias towards lower temperature remains small 
up to 30 000 K. This may be due to the increasing distance 
between the true density and the prior (Do). 



Appendix B: The inversion technique 

We aim to invert Eq. i.e. reconstruct the density field pdm. 
To that end, we take a model, g, which basically relates the 
Doppler parameter b and the gas density nm to the dark 
matter density, pdm and the parameters of the temperature- 
density relation. The goal here is to determine these unknown 

fields, M = (\og{£^),T,f3^ , by fitting the data, D = r{wi), 

i.e. the optical depth along the line of sight. 
Since the corresponding inversion problem is under- 
determined, we use the Bayesian technique described in 
Pichon & al (2001). In order to achieve regularization, we 
require a prior guess for the parameters, M, or in statistical 
terms, their probability distribution function, /prior (M). 
Using Bayes' theorem, the conditional probability density 
/post(M|D) for the realization M given the observed data D 
then writes: 



O 




where jC is the likelihood function of the data given the model. 
If we assume that both functions C and /prior are Gaussian, we 



log(T4) 

Fig. A.2. recovered T versus real T from synthetic spec- 
trum described in Fig. ^ (T4 = T/IO^). The value of /3 
is assumed to be known. T is well recovered in the range 
5 000 K - 30 000 K, when the data span a wide range of 
densities. The error bars correspond to = 1 ± \/2/n, 
where n is the number of pixels. For high T, there is a 
small bias to lower T. 
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-i(M-Mo)^-Co-'-(M-Mo)], (B.2) 

with Cd and Co being respectively the covariance "matrix" of 
the noise and of the prior guess for the parameters, Mq. .4 is a 
normalization constant. The superscript, _L, stands for trans- 
position. In a nutshell, the minimum of the argument of the 



exponential in Eq. B.2 is found using an iterative procedure: 



M[fe+i] = Mo + Co • G[fc]^ ■ (Cd + G[k] ■ Co • G[fc]^) ■ 

(D + G[,j ■ (M[fe] - Mo) - g(M[,j)) , (B.3) 

where subscript [k] refers to the iteration order, while G is the 
matrix (or more rigorously, the functional operator) of partial 
derivatives of the model (?(M) with respect to the parameters. 
In this scheme the minimum corresponds to Mf^oj and in prac- 
tice is found via a convergence criterion on the relative changes 
between iteration [k] and [fc -I- 1]. 
Our model reads, Eq. 

g,{p) = A(z)ci J (exp[p(x)])"-'^exp J^-C2 -^^^j— yjy^^ dx , 

where p = log(|g^). The prior is defined as Mo = {po ~ 0). 
The treatment of the temperature-density relation is different 
and it is described in Section ^. Since the mod el, M = p{x) 



in terms 



is a continuous field, we need to interpret Eq. 
of convolutions, and functional derivatives. In particular the 
matrix of partial functional (Frechet) derivatives, G, has the 
following kernel: 

(GUx) = (^^^ {x) = A{z)c^D^-''{x) exp [{a - f3)p{x)] B,{x) , 
with 

B,{x) = ((q-/3) +C22/3(™, -x)'exp[-2/3p(a;)]) • 
^(exp[p(x)])2/3 



exp — C2 



